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A principle of hierarchical entropy maximization is proposed for generalized superstatistical sys- 
tems, which are characterized by the existence of three levels of dynamics. If a generalized super- 
statistical system comprises a set of superstatistical subsystems, each made up of a set of cells, 
then the Boltzmann-Gibbs-Shannon entropy should be maximized first for each cell, second for each 
subsystem, and finally for the whole system. Hierarchical entropy maximization naturally reflects 
the sufficient time-scale separation between different dynamical levels and allows one to find the 
distribution of both the intensive parameter and the control parameter for the corresponding su- 
perstatistics. The hierarchical maximum entropy principle is applied to fluctuations of the photon 
Bose-Einstein condensate in a dye microcavity. This principle provides an alternative to the master 
equation approach recently applied to this problem. The possibility of constructing generalized 
superstatistics based on a statistics different from the Boltzmann-Gibbs statistics is pointed out. 

PACS numbers: 05.20.-y, 05.30.-d, 05.70.-a, 67.85.-d 



I. INTRODUCTION 

Superstatistics represents a statistics of canonical 
statistics and allows one to consider stationary states of 
nonequilibrium systems with fluctuations of an intensive 
parameter ft [l[. Though usually considered as an in- 
verse temperature, ft can be interpreted in a more gen- 
eral way @, H[ . A superstatistical system comprises a set 
of subsystems, or cells, each having the Gibbs canoni- 
cal distribution determined by ft. An essential feature 
of the superstatistical system is sufficient spatiotemporal 
scale separation, so that ft fluctuates on a much larger 
time scale than the typical relaxation time of the local 
dynamics in a cell. Superstatistics can be given a basis 
by the theory of hyperensembles [3, |j| . 

The distribution of ft can be considered as a function 
of some additional control parameters @. However, in 
ordinary superstatistics, the intensive parameter fluctu- 
ates, but the control parameters are constant. Consid- 
ering the control parameter fluctuations has led very re- 
cently to the generalization of superstatistics — "statistics 
of superstatistics," or "generalized superstatistics" [7|. 
Generalized superstatistics is the statistics of generalized 
superstatistical systems. A generalized superstatistical 
system comprises a set of nonequilibrium superstatistical 
subsystems and can be associated with a generalized hy- 
perensemble, an ensemble of hyperensembles. Compared 
with an ordinary superstatistical system, a generalized 
superstatistical system is characterized by the existence 
of the third, upper level of dynamics in addition to the 
two levels of dynamics existing in each superstatistical 
subsystem. This is reflected in the existence of a fluctuat- 
ing vector control parameter on which both the intensive 
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parameter distribution and the density of energy states 
depend. Significantly, generalized superstatistics can be 
used for nonstationary nonequilibrium systems. It was 
applied to branching processes and pair production in a 
neutron star magnetosphere Q. 

The main problem of generalized superstatistics is the 
determination of the intensive parameter distribution, 
characterizing the superstatistical dynamics in each sub- 
system, and the control parameter distribution, charac- 
terizing the dynamics of the system as a whole. The aim 
of this paper is to develop the maximum entropy princi- 
ple that can be used to solve the above problem. 

The paper is organized as follows: In Sec. II the hier- 
archical maximum entropy principle for generalized su- 
perstatistical systems is formulated and the canonical, 
intensive parameter, and control parameter distributions 
are consecutively determined. In Sec. Ill this principle is 
applied to Bose-Einstein condensation of light and fluc- 
tuations of the number of ground-mode photons are con- 
sidered. In Sec. IV the main conclusions are given. 



II. HIERARCHICAL MAXIMUM ENTROPY 

A generalized superstatistical system is conveniently 
thought of as a set of superstatistical subsystems, each 
in turn made up of a set of cells. There are three levels 
of dynamics in this system: the first, lower level of fast 
dynamics in a cell, the second, middle level of supersta- 
tistical dynamics in a subsystem, and the third, upper 
level of global dynamics in the whole system. The levels 
are arranged in increasing order of dynamical time scale 
so that the shortest time scale corresponds to the lower 
level. The local dynamics in a cell is characterized by 
an energy E, the superstatistical dynamics in a subsys- 
tem is characterized by an intensive parameter ft, and the 
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global dynamics in the whole system is characterized by 
a control parameter £, which may be a multidimensional 
vector. 

The system hierarchy is formed as a result of the suf- 
ficient time-scale separation between different levels of 
dynamics. This allows us to formulate the maximum 
entropy principle for the generalized superstatistical sys- 
tem as a principle of hierarchical entropy maximization. 
More specifically, the entropy should be maximized first 
for each cell, second for each subsystem, and finally for 
the whole system. 



A. Local dynamics 

Though the existence of the Gibbs canonical distribu- 
tion at the lower dynamical level is postulated in super- 
statistics, it is reasonable to explicitly obtain this distri- 
bution from the maximum entropy principle. This trivial 
derivation will allow us to readily observe an analogy be- 
tween the dynamics at different hierarchical levels of a 
generalized superstatistical system. 

Choose a superstatistical subsystem of the generalized 
superstatistical system. A fixed value of the control pa- 
rameter £ corresponds to this subsystem, but the inten- 
sive parameter j3 may still fluctuate. Choosing the sub- 
system also fixes the density of energy states: 



dT(E\Q 
dE ' 



(1) 



where T(E\^) is the number of states with energy less 
than E. In integrals with dT(E\^), integration over E 
will be performed, dT(E\£) = g(E\£)dE. 

To consider the local dynamics, choose a cell of the 
subsystem. Then /3 also becomes fixed, but the energy 
E is not fixed and is characterized by a probability dis- 
tribution p(E\/3,£). To find the distribution maximizing 
the Boltzmann-Gibbs-Shannon entropy 



s[E]m) 



P (E\p,0\np(E\P,OdT(E\0 



under the normalization condition A[_E](/3|£) = 1 and 
the mean energy constraint U[E\(/3\£) — [/(/3|£), where 



N[E](m= J p(E\P,S)dT(E\S), 
U[E}(I3\0= I Ep(E\/3,0dT(E\0, 



we should consider the condition of zero variation, 8L\ = 
0, for the Lagrange function 

LxK/U) = S[E}(P\0-(^-l)N[E](m~PU[E}(m- 
Then we arrive at the Gibbs canonical distribution 

pg(e\p,o 



where 

Z(J3\S) = J e- pE dT(E\0 
is the partition function. The entropy is 

S[E}(/3\0 =1*0910+ WIO. 
where the mean energy 



U(f3\0 = 



8(3 



(2) 



(3) 



(4) 



(5) 



is expressed via the Massieu function 
^(/9|£)=hiZ(/9|£). 

B. Superstatistical dynamics 



Now consider the superstatistical dynamics of the cho- 
sen subsystem. This dynamics is characterized by the 
fluctuating intensive parameter ft that determines the 
properties of cells of the subsystem. To find the intensive 
parameter distribution /(/3|£), we should maximize the 
entropy of the joint probability distribution of E and /3, 
given £. It is written as [5|, |8(] 

s[e, 0\ (o = s[p\ (o + [s[e] (6) 



where 



S\PU) = ~ / /(/?|£)m/(/9|» 



(7) 



is the entropy associated with an d S[E](P\Q is 

given by Eq. The normalization condition for /(/3|£) 
is iV[/3](£) = 1, where 

mm) = / fmw. 

In addition, we may impose a set of n constraints given 
by an n-dimensional vector equality 



where 



M [0](fl = M(0, 



m[/9](o = / mimfmw, 



(8) 



(9) 



and m(/3|0 = [mi(/9|0, . . . , m n (/9|£)] and M(£) = 
[Mi(£), • ■ • , M n (£)] are n-dimensional vectors specifying, 
respectively, the form and values of the constraints. Each 
Mj(£) is the mean of mj(/3|£) over the fluctuating j3, 
given £. We consider M[(3]{£) as some general constraint 
vector, but it may be composed of the constraints used in 
ordinary superstatistics, e.g., the mean values of energy, 
entropy, square of entropy, energy divided by tempera- 
ture, or logarithm of the partition function [4], la, l9t [lOj . 
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Also define an n-dimensional vector Lagrange multi- 
plier p — (pi, . . . , p n ), where each pi is the Lagrange 
multiplier corresponding to the constraint Mi [/?](£) = 
Mi(£). We then have the following Lagrange function: 

£ 2 KM) = S[E,0\($ - {v 2 - 1)N[(3}(0 - p • M[/3](£). 

By a ■ b — ^2 aibi we denote the scalar product of some 
vectors a and b. The condition 8L 2 = yields the inten- 
sive parameter distribution 

/G%, = exp[-/x • m(^|0 + WIOL (10) 

where the partition function 

YM = J Z(f3\0eM-V-™m)+PU(mW (11) 

is determined from the normalization condition 
for/(/3|M,0- _ 

Note that f((3\p, £) and still depend on the 

Lagrange multiplier p. The implicit dependence of p on 
the control parameter £, 



is determined from 



M(0 



dp 



where 



£ 2 (/x,0 = In Y (p, 



(12) 



(13) 



(14) 



is the Massieu function and d/dp — (d/dpi, . . . , d/dp n ) 
is the n-dimensional gradient operator. Equations (|13[) 
and (|T4| are analogous to Eqs. ((4]) and ([5]), respectively. 
Thus, given the constraints ©, the intensive parameter 
distribution (fTU|) . partition function (dJ), and Massieu 
function (fl"4"|) depend only on /? and £: 

/(/9|0 = /G8KO.O, (is) 



nO = nA*(0,0. v 2 (0 = h(m,0- (16) 

We may either first set the constraint vector M (£) and 
then find from the maximum entropy principle, or 
vice versa. This is in full analogy with the case of the dy- 
namics in a cell, when we may first set the mean energy 
U(/3) and then find the corresponding intensive parame- 
ter /3, or set j3 and then find U(/3), which is more com- 
mon. Incidentally, this duality allows one to alternatively 
formulate superstatistics by introducing the fluctuations 
of U ((3) instead of those of j3 [ll| . Note that the control 
parameter £ has a more general nature than /?, since /? is 
exactly a Lagrange multiplier, while £, though controlling 
the Lagrange multiplier p, may not coincide with p. The 
analogy between j3 and £ will be complete if we choose 

m = e 



It follows from Eqs. ©, ©"CGI, and ([iljl-pl) 

that the entropy associated with the superstatistical sub- 
system is 



S[E,p}(0=v 2 (0+»(0-M(0- 



(17) 



It is analogous to Eq. ([3]). 

Thus, the intensive parameter distribution for the su- 
perstatistical subsystem is given by Eq. (H~5l) . The super- 
statistical distribution 



p{E\0 = / p a (E\MW\Od0 



has the form 



P(E\0 = 



no 



exp{-P[E-U(l3\0}-rt0MP\0W, 

(18) 

with the normalization condition J p(E\£)dT(E\£) = 1. 

Ordinary superstatistics is a special case of general- 
ized superstatistics: an ordinary superstatistical system 
is a generalized superstatistical system without fluctua- 
tions of the control parameter £. Therefore, we can easily 
obtain the intensive parameter distribution / = f(/3\p) 
for this system by formally removing £ from Eq. ()10|) and 
from subsidiary Eqs. ©, ©, ©, ©, ©, ©, @TJ, tH, 
and (| 14JK It is consistent with the distributions obtained 
earlieTjaBlli- 



C. Global dynamics 

Consider the third level of dynamics. We should find 
the probability distribution c(£) of the fluctuating control 
parameter £. This distribution is normalized, N[£] = 1, 
where 



N[Z] 



The entropy of the joint probability distribution of E, /3, 
and £ is determined by analogy with the entropy associ- 
ated with a superstatistical subsystem [cf. Eq. (j6))] : 

S[E, /3, £] = + J S[E, ff\ (0c(0d£, (19) 



where 



c(01nc(0de 



(20) 



is the entropy associated with the control parameter dis- 
tribution c(£), and /?](£) is given by Eq. ([TT|). We 
may impose a set of m additional constraints by analogy 
with Eqs. © and ©: 



where 



K[£] = K, (21) 
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and fc(£) = ■ • • , ^m(C)] and ^ = ■ • ■ > K m) are 

TO-dimensional vectors specifying, respectively, the form 
and values of the constraints. Each JQ is the mean of 
ki(£) over the fluctuating £. 
The Lagrange function is 

LsfaK) = S[E,P,£] - (u 3 - i)N[£] - k • K[£], 

where we have defined an m-dimensional vector La- 
grange multiplier k — . . . , /c m ), where each is 
the Lagrange multiplier corresponding to the constraint 
Ki[£] = Ki. The condition 6L3 = yields the control 
parameter distribution 



c(£, k) 



no 



exp[-K-k(0 + n(0-M(Z)}, (22) 



X{k) 

where the partition function is 

X(k) = [ Y(0 exp[- K • k(0 + M (0 • M(0]d£, 



and Y*(£) is defined by Eq. (|16p . By analogy with 
Eq. (fl~3|) . we can rewrite the constraints ([21]) as follows: 



K 



dn 



(23) 



where 



vs{k) = lnX(«) 



is the Massieu function, and d/dn = {d/dni, . . . , d/dn m ) 
is the m-dimcnsional gradient operator. It remains to 
find the entropy (|19p at the maximum point [cf. Eqs. Q 
and©]: 

S[E,/3,£] = + 

Thus, the intensive parameter distribution c(£) = 
c(£, k) is given by Eq. (f2"2"j) . with the Lagrange multi- 
plier k determined from Eq. (f2"3"|). By Eqs. {TSJ and ([2"2"|) . 
we get that the generalized superstatistical distribution 

a{E) = J p(E\0g(E\0c{m 



has the form 
1 



a(E) 



X(k) 



exp{-/3[£ - U(fi\$] 



-MO • - M (0] - « • k(0}s(^IO<#de» 

with the normalization condition J a(E)dE = 1. 



III. BOSE-EINSTEIN CONDENSATION OF 
LIGHT 

Recently, thermalization of light in a dye microcavity 
has been observed [HI]. In this experiment, photons are 
confined in a curved-mirror optical microresonator filled 



with a dye solution. In the microresonator, absorption 
and reemission of photons by dye molecules results in 
thermalization of the photon gas. Since the free spec- 
tral range of the microresonator is comparable to the 
spectral width of the dye, the emission of photons with 
a fixed longitudinal number dominates. Therefore, the 
photon gas is effectively two dimensional, and thermal- 
ization of transverse photon states occurs. Moreover, 
Bose-Einstein condensation (BEC) of light has been ex- 
perimentally observed in the described system [l3|, E3] ■ 
This reflects the fact that a two-dimensional harmoni- 
cally tra ppe d ideal gas of massive bosons can undergo 
BEC [TMlj]. In the case of the light BEC, the curvature 
of the mirrors provides a nonvanishing effective photon 
mass and at the same time induces a harmonic trapping 
potential for photons. 

The problem of thermalization and fluctuations of the 
photon Bose-Einstein condensate has been considered 
very recently in Ref . (20( . The condensate exchanges ex- 
citations with a reservoir consisting of M dye molecules. 
The authors assume that the ground-state photon mode 
is coupled to the electronic transitions of a given number 
of dye molecules. This means that the sum X of the num- 
ber of ground-mode photons, n, and that of excited dye 
molecules, X — n, is constant. To analyze this system, 
the authors use the master equation approach. 

Note that if we are interested in the behavior of the 
fluctuating photon BEC after thermalization has oc- 
curred, we can obtain the corresponding probability dis- 
tribution merely using the thermodynamic consideration. 
The population of the electronic states of dye molecules 
is quickly thermalized, with the characteristic time ~1 ps 
at room temperature (see Refs. pH - (25l | for details). Since 
the typical fluorescence lifetime is ~1 — 10 ns, the emis- 
sion of photons occurs from thermally equilibrated ex- 
cited states. This apparent time-scale separation allows 
us to consider the above system as a generalized super- 
statistical system. Therefore, we can find the limiting 
probability distribution of the number of ground-mode 
photons by directly applying the hierarchical maximum 
entropy principle to this system. 

For simplicity, consider the case of the ground-mode 
coupling and neglect the twofold polarization degeneracy 
by analogy with Ref. [2(| ■ The whole system is then com- 
posed of two subsystems: the subsystem of the dye solu- 
tion and the subsystem of the photon BEC. The control 
parameter characterizing the interaction of the subsys- 
tems is the fluctuating number of ground-mode photons, 
n. The subsystem of the dye solution in turn consists of 
M dye molecules, among which there are X — n excited 
molecules and M — X + n ground-state molecules. Ob- 
viously, ^ n ^ X ^ M. Each molecule is in contact 
with a solvent, which plays the role of thermostat. In 
this sense, dye molecules resemble cells, but the inverse 
temperature /? does not fluctuate. For /(/3), this formally 
corresponds to the conditions of normalization, a given 
mean, and zero variance. In what follows, we will not 
explicitly indicate the dependence of functions on f3. 
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Let Dq(so) and Di(e±) be the density of rovibrational 
states for the ground, So, and first excited, Si, singlet 
electronic state, respectively. Note that £j = E — Ei, 
where Ei is the lowest-energy substate of Si, where i — 
0, 1. Hence, Di(e) — for any e < 0. The partition 
functions Zq and Z\ corresponding, respectively, to the 
ground-state and excited dye molecules are 



Zi = e 



where 



w, 



/>oo 

/ e- pe Di{E)de, 
Jo 



0,1. 



It follows from Eqs. ©-([5]) and (l24l) that the entropy for 
a ground-state molecule, sq, and for an excited molecule, 



Si, is 



where 



Inwi + [3{ui - Ei), 



1 dwi 

m = Ei — 

uii dp 



(25) 



is the corresponding mean energy. 

Now consider the subsystem of all dye molecules. After 
enumerating them and denoting a ground-state molecule 
by and an excited molecule by 1, we can write an M- 
digit binary number r\ — {j]\T]i ■ . - T]m) with M — X + 
n zeros and X — n unities such that the state of the 
kth dye molecule is given by the fcth digit rjk- For any 
given r\, the entropy of the corresponding combination of 
dye molecules is 

s v \ n = (M — X + n)s + (X - n)sx. 

The probability that r\ takes on a fixed value is 



M 

Pv\n - [x-n 



(X - n)\(M - X + n)\ 
Ml ' 



The entropy of the subsystem of dye molecules is cal- 
culated using the discrete analogs of Eqs. (J6j> and ((7J, 
with S[E](f3\£) and /(/3|£) replaced by s^|„ and p v \ n , re- 
spectively: 



In 



M 



X 



The mean energy of the subsystem is 

?4 = (M - X + n)u + (X - n)u x , 

where uq and u\ are defined by Eq. (|25p . 

The entropy of the photon BEC is zero, sf^ — 0, since 
the absence of the polarization degeneracy is assumed. 
The total energy of the condensate is 



where huj is the energy of a ground-mode photon. 

Finally, consider the system as a whole. The control 
parameter n corresponding to the number of ground- 
mode photons is characterized by a normalized discrete 
probability distribution (ttq, ■ ■ ■ , t^x), where 7r„ is the 
probability of n photons. For a fixed n, the energy and 
entropy of the system are given by U n — + and 
S n = s « + sP h , respectively. Maximizing the entropy 



( 24 ) [see Eqs. CDS) and ® ] 



A" 



A" 



S = - ^ 7T„ In 7T n + 22 n nS„ 



n=0 n=0 

under the normalization condition 

A 



J2 nn = 1 



(26) 



and the mean energy constraint ^Tt n U n = U yields 



(27) 



1 / M 
Z\X-n)' b " 

x exp{-f3[(M - X + n)E + (X - n)Ei + nhui}}, 



where Z is determined from Eq. (|26|). Dividing Eq. ([27]) 
by ttq and writing huo = Ei—Eq, we obtain the probabil- 
ity distribution of the number of ground-mode photons 
in the form 



Xl(M-X) 



7TQ 



(X - n)l(M - X + n)\ \wij 



\ n 

W \ -j3nh{u 



-u ) 



(28) 



and 



This equation allows us to find 7Tq = (^7r n /7To) _1 
then calculate ir n for all positive n ^ X. 

Thus, the long-run behavior of the photon BEC, when 
the probability distribution (itq, . . . ,irx) becomes sta- 
tionary, can be investigated using the hierarchical max- 
imum entropy principle. The link with the result of the 
master equation approach can be readily observed via the 
Kennard-Stepanov law (20l. [26l430l| . 



(29) 



B x(oj) wi 



which relates the Einstein coefficients for stimulated 
emission, Biq(lu), and absorption, J5oi(w). Equation ()29j) 
allows us to rewrite Eq. ([2"5]) as 



X\{M -X)\ 



7T0 



[X - n)\{M - X + n)\ 



Bio(w) 



B m {uj) 



nftw. 



which is identical to Eq. (10) of Ref. . 

It seems interesting to use the described approach for 
studying the photon BEC fluctuations in more detail, 
e.g., for considering a more realistic situation of the po- 
larization degeneracy and additional fluctuations of M 
and X. 
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IV. CONCLUSION 

I have formulated the hierarchical maximum entropy 
principle for generalized superstatistical systems. Such 
systems comprise a set of nonequilibrium superstatistical 
subsystems, where each subsystem is made up of many 
cells, and are characterized by the three-level dynami- 
cal hierarchy formed as a result of the sufficient time- 
scale separation between different dynamical levels. By 
arranging these levels in increasing order of dynamical 
time scale and consecutively maximizing the entropy at 
each level, I have obtained first the Gibbs canonical dis- 
tribution for each cell, second the intensive parameter 
distribution for each subsystem, and finally the control 
parameter distribution for the whole system. From these 
distributions, I have also found the superstatistical dis- 
tribution for each subsystem and the generalized super- 
statistical distribution for the whole system. 

I have applied this principle to Bose-Einstein condensa- 



tion of light in a dye microcavity. Assuming the ground- 
mode coupling and neglecting the polarization degener- 
acy, I have obtained the long-run probability distribution 
of the fluctuating number of ground- mode photons. This 
distribution is consistent with the analogous result of the 
master equation approach. 

Note that when the hierarchical maximum entropy 
principle is applied to a generalized superstatistical sys- 
tem, certain constraints should be imposed on a nor- 
malized distribution to obtain the canonical distribution 
at the lower dynamical level. However, the constraints 
imposed on the intensive and control parameter distri- 
butions may be quite general. I propose erasing such 
a distinction, viz., choosing some general constraints at 
the lower dynamical level and additionally considering 
a vector intensive parameter. This will result in the 
generalized superstatistics the local dynamics of which 
is described by a more general statistics than the usual 
Boltzmann-Gibbs statistics. Grand canonical statistics 
may be the simplest alternative. 
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